Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 27378 |
| Missing cells | 9811 |
| Missing cells (%) | 2.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 10.9 MiB |
| Average record size in memory | 417.4 B |
Variable types
| NUM | 7 |
|---|---|
| CAT | 6 |
Budget Line has a high cardinality: 4163 distinct values | High cardinality |
Budget Line Description has a high cardinality: 1899 distinct values | High cardinality |
First Fiscal Year is highly correlated with Published Date | High correlation |
Published Date is highly correlated with First Fiscal Year | High correlation |
Fiscal Year 2 Amount is highly correlated with Fiscal Year 1 Amount and 2 other fields | High correlation |
Fiscal Year 1 Amount is highly correlated with Fiscal Year 2 Amount | High correlation |
Fiscal Year 3 Amount is highly correlated with Fiscal Year 2 Amount and 1 other fields | High correlation |
Fiscal Year 4 Amount is highly correlated with Fiscal Year 2 Amount and 2 other fields | High correlation |
Fiscal Year 5 Amount is highly correlated with Fiscal Year 4 Amount | High correlation |
Project Type Description is highly correlated with Project Type | High correlation |
Project Type is highly correlated with Project Type Description | High correlation |
Fiscal Year 5 Amount has 9640 (35.2%) missing values | Missing |
Fiscal Year 1 Amount is highly skewed (γ1 = 102.9564547) | Skewed |
Fiscal Year 2 Amount is highly skewed (γ1 = 98.30545073) | Skewed |
Fiscal Year 3 Amount is highly skewed (γ1 = 76.32076244) | Skewed |
Fiscal Year 4 Amount is highly skewed (γ1 = 67.77666991) | Skewed |
Fiscal Year 5 Amount is highly skewed (γ1 = 32.34754482) | Skewed |
Fiscal Year 1 Amount has 10517 (38.4%) zeros | Zeros |
Fiscal Year 2 Amount has 16259 (59.4%) zeros | Zeros |
Fiscal Year 3 Amount has 19858 (72.5%) zeros | Zeros |
Fiscal Year 4 Amount has 21110 (77.1%) zeros | Zeros |
Fiscal Year 5 Amount has 14676 (53.6%) zeros | Zeros |
Reproduction
| Analysis started | 2020-12-12 20:10:00.983606 |
|---|---|
| Analysis finished | 2020-12-12 20:10:08.783819 |
| Duration | 7.8 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20182187.54 |
|---|---|
| Minimum | 20160426 |
| Maximum | 20200416 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 214.0 KiB |
Quantile statistics
| Minimum | 20160426 |
|---|---|
| 5-th percentile | 20160426 |
| Q1 | 20170426 |
| median | 20181010 |
| Q3 | 20191025 |
| 95-th percentile | 20200416 |
| Maximum | 20200416 |
| Range | 39990 |
| Interquartile range (IQR) | 20599 |
Descriptive statistics
| Standard deviation | 13415.01729 |
|---|---|
| Coefficient of variation (CV) | 0.0006646958989 |
| Kurtosis | -1.217536075 |
| Mean | 20182187.54 |
| Median Absolute Deviation (MAD) | 10015 |
| Skewness | -0.1158771836 |
| Sum | 5.525277483e+11 |
| Variance | 179962688.9 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=13)
| Value | Count | Frequency (%) | |
| 20200116 | 4106 | 15.0% | |
| 20190207 | 1967 | 7.2% | |
| 20171106 | 1963 | 7.2% | |
| 20181010 | 1960 | 7.2% | |
| 20200416 | 1958 | 7.2% | |
| 20180201 | 1958 | 7.2% | |
| 20170426 | 1957 | 7.1% | |
| 20180426 | 1950 | 7.1% | |
| 20191025 | 1947 | 7.1% | |
| 20190425 | 1946 | 7.1% | |
| 20160426 | 1896 | 6.9% | |
| 20161026 | 1889 | 6.9% | |
| 20170124 | 1880 | 6.9% | |
| (Missing) | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 20160426 | 1896 | 6.9% | |
| 20161026 | 1889 | 6.9% | |
| 20170124 | 1880 | 6.9% | |
| 20170426 | 1957 | 7.1% | |
| 20171106 | 1963 | 7.2% | |
| 20180201 | 1958 | 7.2% | |
| 20180426 | 1950 | 7.1% | |
| 20181010 | 1960 | 7.2% | |
| 20190207 | 1967 | 7.2% | |
| 20190425 | 1946 | 7.1% |
| Value | Count | Frequency (%) | |
| 20200416 | 1958 | 7.2% | |
| 20200116 | 4106 | 15.0% | |
| 20191025 | 1947 | 7.1% | |
| 20190425 | 1946 | 7.1% | |
| 20190207 | 1967 | 7.2% | |
| 20181010 | 1960 | 7.2% | |
| 20180426 | 1950 | 7.1% | |
| 20180201 | 1958 | 7.2% | |
| 20171106 | 1963 | 7.2% | |
| 20170426 | 1957 | 7.1% |
| Distinct | 40 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 214.0 KiB |
| PV | |
|---|---|
| PW | |
| HD | |
| P | |
| HL | |
| Other values (35) |
| Value | Count | Frequency (%) | |
| PV | 7730 | 28.2% | |
| PW | 2941 | 10.7% | |
| HD | 2081 | 7.6% | |
| P | 1954 | 7.1% | |
| HL | 1804 | 6.6% | |
| HB | 1378 | 5.0% | |
| HW | 1221 | 4.5% | |
| ED | 1151 | 4.2% | |
| HR | 824 | 3.0% | |
| SE | 502 | 1.8% | |
| CS | 481 | 1.8% | |
| HN | 476 | 1.7% | |
| AG | 463 | 1.7% | |
| CO | 373 | 1.4% | |
| E | 351 | 1.3% | |
| PO | 338 | 1.2% | |
| TF | 294 | 1.1% | |
| WP | 287 | 1.0% | |
| F | 281 | 1.0% | |
| S | 263 | 1.0% | |
| HO | 253 | 0.9% | |
| PU | 217 | 0.8% | |
| LN | 216 | 0.8% | |
| HH | 190 | 0.7% | |
| C | 146 | 0.5% | |
| Other values (15) | 1162 | 4.2% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 1.882569947 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| P | 13610 | 26.4% | |
| H | 8519 | 16.5% | |
| V | 7730 | 15.0% | |
| W | 4608 | 8.9% | |
| D | 3310 | 6.4% | |
| L | 2296 | 4.5% | |
| E | 2154 | 4.2% | |
| B | 1613 | 3.1% | |
| S | 1266 | 2.5% | |
| C | 1000 | 1.9% | |
| R | 980 | 1.9% | |
| O | 964 | 1.9% | |
| F | 693 | 1.3% | |
| N | 692 | 1.3% | |
| A | 683 | 1.3% | |
| T | 502 | 1.0% | |
| G | 463 | 0.9% | |
| U | 217 | 0.4% | |
| M | 149 | 0.3% | |
| Q | 89 | 0.2% | |
| n | 2 | < 0.1% | |
| a | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 51538 | > 99.9% | |
| Lowercase Letter | 3 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| P | 13610 | 26.4% | |
| H | 8519 | 16.5% | |
| V | 7730 | 15.0% | |
| W | 4608 | 8.9% | |
| D | 3310 | 6.4% | |
| L | 2296 | 4.5% | |
| E | 2154 | 4.2% | |
| B | 1613 | 3.1% | |
| S | 1266 | 2.5% | |
| C | 1000 | 1.9% | |
| R | 980 | 1.9% | |
| O | 964 | 1.9% | |
| F | 693 | 1.3% | |
| N | 692 | 1.3% | |
| A | 683 | 1.3% | |
| T | 502 | 1.0% | |
| G | 463 | 0.9% | |
| U | 217 | 0.4% | |
| M | 149 | 0.3% | |
| Q | 89 | 0.2% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 2 | 66.7% | |
| a | 1 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 51541 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| P | 13610 | 26.4% | |
| H | 8519 | 16.5% | |
| V | 7730 | 15.0% | |
| W | 4608 | 8.9% | |
| D | 3310 | 6.4% | |
| L | 2296 | 4.5% | |
| E | 2154 | 4.2% | |
| B | 1613 | 3.1% | |
| S | 1266 | 2.5% | |
| C | 1000 | 1.9% | |
| R | 980 | 1.9% | |
| O | 964 | 1.9% | |
| F | 693 | 1.3% | |
| N | 692 | 1.3% | |
| A | 683 | 1.3% | |
| T | 502 | 1.0% | |
| G | 463 | 0.9% | |
| U | 217 | 0.4% | |
| M | 149 | 0.3% | |
| Q | 89 | 0.2% | |
| n | 2 | < 0.1% | |
| a | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 51541 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| P | 13610 | 26.4% | |
| H | 8519 | 16.5% | |
| V | 7730 | 15.0% | |
| W | 4608 | 8.9% | |
| D | 3310 | 6.4% | |
| L | 2296 | 4.5% | |
| E | 2154 | 4.2% | |
| B | 1613 | 3.1% | |
| S | 1266 | 2.5% | |
| C | 1000 | 1.9% | |
| R | 980 | 1.9% | |
| O | 964 | 1.9% | |
| F | 693 | 1.3% | |
| N | 692 | 1.3% | |
| A | 683 | 1.3% | |
| T | 502 | 1.0% | |
| G | 463 | 0.9% | |
| U | 217 | 0.4% | |
| M | 149 | 0.3% | |
| Q | 89 | 0.2% | |
| n | 2 | < 0.1% | |
| a | 1 | < 0.1% |
| Distinct | 41 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 214.0 KiB |
| CULTURAL INSTITUTIONS | |
|---|---|
| PUBLIC BUILDINGS | |
| HOUSING & DEVELOPMENT | |
| PARKS | |
| HEALTH | |
| Other values (36) |
| Value | Count | Frequency (%) | |
| CULTURAL INSTITUTIONS | 7730 | 28.2% | |
| PUBLIC BUILDINGS | 2941 | 10.7% | |
| HOUSING & DEVELOPMENT | 2081 | 7.6% | |
| PARKS | 1954 | 7.1% | |
| HEALTH | 1804 | 6.6% | |
| HIGHWAY BRIDGES | 1378 | 5.0% | |
| HIGHWAYS | 1221 | 4.5% | |
| ECONOMIC DEVELOPMENT | 1151 | 4.2% | |
| HUMAN RESOURCES | 824 | 3.0% | |
| SEWERS | 502 | 1.8% | |
| ADMIN FOR CHILDREN'S SERVICES | 481 | 1.8% | |
| HIGHER EDUCATION | 476 | 1.7% | |
| DEPARTMENT FOR THE AGING | 463 | 1.7% | |
| COURTS | 373 | 1.4% | |
| EDUCATION | 351 | 1.3% | |
| POLICE | 338 | 1.2% | |
| TRAFFIC | 294 | 1.1% | |
| WATER POLLUTION CONTROL | 287 | 1.0% | |
| FIRE | 281 | 1.0% | |
| SANITATION | 263 | 1.0% | |
| HEALTH & HOSPITALS CORP. | 253 | 0.9% | |
| NEW YORK PUBLIC LIBRARY | 216 | 0.8% | |
| HOMELESS SERVICES | 190 | 0.7% | |
| EDP EQUIP & FINANC COSTS | 178 | 0.7% | |
| CORRECTION | 146 | 0.5% | |
| Other values (16) | 1201 | 4.4% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 31 |
|---|---|
| Median length | 17 |
| Mean length | 15.94141281 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| I | 47280 | 10.8% | |
| T | 42547 | 9.7% | |
| U | 35699 | 8.2% | |
| S | 32297 | 7.4% | |
| N | 31324 | 7.2% | |
| L | 29865 | 6.8% | |
| 26272 | 6.0% | ||
| E | 25214 | 5.8% | |
| O | 22644 | 5.2% | |
| R | 20933 | 4.8% | |
| A | 20679 | 4.7% | |
| C | 18502 | 4.2% | |
| H | 15047 | 3.4% | |
| P | 11167 | 2.6% | |
| G | 10752 | 2.5% | |
| D | 10273 | 2.4% | |
| B | 8547 | 2.0% | |
| M | 6735 | 1.5% | |
| W | 4078 | 0.9% | |
| V | 4021 | 0.9% | |
| Y | 3941 | 0.9% | |
| & | 2669 | 0.6% | |
| K | 2357 | 0.5% | |
| F | 2109 | 0.5% | |
| Q | 499 | 0.1% | |
| Other values (5) | 993 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 406510 | 93.1% | |
| Space Separator | 26272 | 6.0% | |
| Other Punctuation | 3659 | 0.8% | |
| Lowercase Letter | 3 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| I | 47280 | 11.6% | |
| T | 42547 | 10.5% | |
| U | 35699 | 8.8% | |
| S | 32297 | 7.9% | |
| N | 31324 | 7.7% | |
| L | 29865 | 7.3% | |
| E | 25214 | 6.2% | |
| O | 22644 | 5.6% | |
| R | 20933 | 5.1% | |
| A | 20679 | 5.1% | |
| C | 18502 | 4.6% | |
| H | 15047 | 3.7% | |
| P | 11167 | 2.7% | |
| G | 10752 | 2.6% | |
| D | 10273 | 2.5% | |
| B | 8547 | 2.1% | |
| M | 6735 | 1.7% | |
| W | 4078 | 1.0% | |
| V | 4021 | 1.0% | |
| Y | 3941 | 1.0% | |
| K | 2357 | 0.6% | |
| F | 2109 | 0.5% | |
| Q | 499 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 26272 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| & | 2669 | 72.9% | |
| ' | 481 | 13.1% | |
| . | 470 | 12.8% | |
| , | 39 | 1.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 2 | 66.7% | |
| a | 1 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 406513 | 93.1% | |
| Common | 29931 | 6.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| I | 47280 | 11.6% | |
| T | 42547 | 10.5% | |
| U | 35699 | 8.8% | |
| S | 32297 | 7.9% | |
| N | 31324 | 7.7% | |
| L | 29865 | 7.3% | |
| E | 25214 | 6.2% | |
| O | 22644 | 5.6% | |
| R | 20933 | 5.1% | |
| A | 20679 | 5.1% | |
| C | 18502 | 4.6% | |
| H | 15047 | 3.7% | |
| P | 11167 | 2.7% | |
| G | 10752 | 2.6% | |
| D | 10273 | 2.5% | |
| B | 8547 | 2.1% | |
| M | 6735 | 1.7% | |
| W | 4078 | 1.0% | |
| V | 4021 | 1.0% | |
| Y | 3941 | 1.0% | |
| K | 2357 | 0.6% | |
| F | 2109 | 0.5% | |
| Q | 499 | 0.1% | |
| n | 2 | < 0.1% | |
| a | 1 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 26272 | 87.8% | ||
| & | 2669 | 8.9% | |
| ' | 481 | 1.6% | |
| . | 470 | 1.6% | |
| , | 39 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 436444 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| I | 47280 | 10.8% | |
| T | 42547 | 9.7% | |
| U | 35699 | 8.2% | |
| S | 32297 | 7.4% | |
| N | 31324 | 7.2% | |
| L | 29865 | 6.8% | |
| 26272 | 6.0% | ||
| E | 25214 | 5.8% | |
| O | 22644 | 5.2% | |
| R | 20933 | 4.8% | |
| A | 20679 | 4.7% | |
| C | 18502 | 4.2% | |
| H | 15047 | 3.4% | |
| P | 11167 | 2.6% | |
| G | 10752 | 2.5% | |
| D | 10273 | 2.4% | |
| B | 8547 | 2.0% | |
| M | 6735 | 1.5% | |
| W | 4078 | 0.9% | |
| V | 4021 | 0.9% | |
| Y | 3941 | 0.9% | |
| & | 2669 | 0.6% | |
| K | 2357 | 0.5% | |
| F | 2109 | 0.5% | |
| Q | 499 | 0.1% | |
| Other values (5) | 993 | 0.2% |
| Distinct | 4163 |
|---|---|
| Distinct (%) | 15.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 214.0 KiB |
| P 0245K | 24 |
|---|---|
| PV0175 | 24 |
| P 0822 | 24 |
| HB0215 | 24 |
| P 0245R | 24 |
| Other values (4158) |
| Value | Count | Frequency (%) | |
| P 0245K | 24 | 0.1% | |
| PV0175 | 24 | 0.1% | |
| P 0822 | 24 | 0.1% | |
| HB0215 | 24 | 0.1% | |
| P 0245R | 24 | 0.1% | |
| ED0075 | 24 | 0.1% | |
| ED0409 | 24 | 0.1% | |
| P 1008 | 24 | 0.1% | |
| TF0502 | 24 | 0.1% | |
| F 0109 | 24 | 0.1% | |
| HW1684 | 24 | 0.1% | |
| PU0015 | 24 | 0.1% | |
| FA0313 | 24 | 0.1% | |
| C 0075 | 24 | 0.1% | |
| HW0003 | 24 | 0.1% | |
| P 1322 | 24 | 0.1% | |
| HB1203 | 24 | 0.1% | |
| P 1018 | 24 | 0.1% | |
| HW0876 | 24 | 0.1% | |
| HW0349 | 24 | 0.1% | |
| S 0136 | 24 | 0.1% | |
| LB0104 | 24 | 0.1% | |
| HB1012 | 24 | 0.1% | |
| CS0003 | 24 | 0.1% | |
| HB1027 | 24 | 0.1% | |
| Other values (4138) | 26777 | 97.8% |
Frequencies of value counts
Unique
| Unique | 1771 ? |
|---|---|
| Unique (%) | 6.5% |
Histogram of lengths of the category
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.594601505 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 24613 | 13.6% | |
| N | 14321 | 7.9% | |
| P | 13627 | 7.5% | |
| 1 | 12182 | 6.7% | |
| D | 11522 | 6.4% | |
| 2 | 9804 | 5.4% | |
| H | 8544 | 4.7% | |
| V | 7735 | 4.3% | |
| 4 | 7370 | 4.1% | |
| 3 | 7219 | 4.0% | |
| 7 | 7205 | 4.0% | |
| 6 | 6615 | 3.7% | |
| 9 | 6155 | 3.4% | |
| 8 | 6018 | 3.3% | |
| 5 | 6011 | 3.3% | |
| W | 4632 | 2.6% | |
| 3216 | 1.8% | ||
| M | 2843 | 1.6% | |
| L | 2315 | 1.3% | |
| E | 2165 | 1.2% | |
| K | 2014 | 1.1% | |
| - | 1960 | 1.1% | |
| R | 1637 | 0.9% | |
| B | 1628 | 0.9% | |
| Q | 1564 | 0.9% | |
| Other values (15) | 7632 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 93192 | 51.6% | |
| Uppercase Letter | 82176 | 45.5% | |
| Space Separator | 3216 | 1.8% | |
| Dash Punctuation | 1960 | 1.1% | |
| Lowercase Letter | 3 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 14321 | 17.4% | |
| P | 13627 | 16.6% | |
| D | 11522 | 14.0% | |
| H | 8544 | 10.4% | |
| V | 7735 | 9.4% | |
| W | 4632 | 5.6% | |
| M | 2843 | 3.5% | |
| L | 2315 | 2.8% | |
| E | 2165 | 2.6% | |
| K | 2014 | 2.5% | |
| R | 1637 | 2.0% | |
| B | 1628 | 2.0% | |
| Q | 1564 | 1.9% | |
| X | 1351 | 1.6% | |
| C | 1334 | 1.6% | |
| S | 1275 | 1.6% | |
| O | 969 | 1.2% | |
| A | 756 | 0.9% | |
| F | 704 | 0.9% | |
| T | 508 | 0.6% | |
| G | 474 | 0.6% | |
| U | 223 | 0.3% | |
| J | 16 | < 0.1% | |
| Z | 9 | < 0.1% | |
| Y | 6 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 24613 | 26.4% | |
| 1 | 12182 | 13.1% | |
| 2 | 9804 | 10.5% | |
| 4 | 7370 | 7.9% | |
| 3 | 7219 | 7.7% | |
| 7 | 7205 | 7.7% | |
| 6 | 6615 | 7.1% | |
| 9 | 6155 | 6.6% | |
| 8 | 6018 | 6.5% | |
| 5 | 6011 | 6.5% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 3216 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 2 | 66.7% | |
| a | 1 | 33.3% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 1960 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 98368 | 54.5% | |
| Latin | 82179 | 45.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| N | 14321 | 17.4% | |
| P | 13627 | 16.6% | |
| D | 11522 | 14.0% | |
| H | 8544 | 10.4% | |
| V | 7735 | 9.4% | |
| W | 4632 | 5.6% | |
| M | 2843 | 3.5% | |
| L | 2315 | 2.8% | |
| E | 2165 | 2.6% | |
| K | 2014 | 2.5% | |
| R | 1637 | 2.0% | |
| B | 1628 | 2.0% | |
| Q | 1564 | 1.9% | |
| X | 1351 | 1.6% | |
| C | 1334 | 1.6% | |
| S | 1275 | 1.6% | |
| O | 969 | 1.2% | |
| A | 756 | 0.9% | |
| F | 704 | 0.9% | |
| T | 508 | 0.6% | |
| G | 474 | 0.6% | |
| U | 223 | 0.3% | |
| J | 16 | < 0.1% | |
| Z | 9 | < 0.1% | |
| Y | 6 | < 0.1% | |
| Other values (3) | 7 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 24613 | 25.0% | |
| 1 | 12182 | 12.4% | |
| 2 | 9804 | 10.0% | |
| 4 | 7370 | 7.5% | |
| 3 | 7219 | 7.3% | |
| 7 | 7205 | 7.3% | |
| 6 | 6615 | 6.7% | |
| 9 | 6155 | 6.3% | |
| 8 | 6018 | 6.1% | |
| 5 | 6011 | 6.1% | |
| 3216 | 3.3% | ||
| - | 1960 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 180547 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 24613 | 13.6% | |
| N | 14321 | 7.9% | |
| P | 13627 | 7.5% | |
| 1 | 12182 | 6.7% | |
| D | 11522 | 6.4% | |
| 2 | 9804 | 5.4% | |
| H | 8544 | 4.7% | |
| V | 7735 | 4.3% | |
| 4 | 7370 | 4.1% | |
| 3 | 7219 | 4.0% | |
| 7 | 7205 | 4.0% | |
| 6 | 6615 | 3.7% | |
| 9 | 6155 | 3.4% | |
| 8 | 6018 | 3.3% | |
| 5 | 6011 | 3.3% | |
| W | 4632 | 2.6% | |
| 3216 | 1.8% | ||
| M | 2843 | 1.6% | |
| L | 2315 | 1.3% | |
| E | 2165 | 1.2% | |
| K | 2014 | 1.1% | |
| - | 1960 | 1.1% | |
| R | 1637 | 0.9% | |
| B | 1628 | 0.9% | |
| Q | 1564 | 0.9% | |
| Other values (15) | 7632 | 4.2% |
| Distinct | 1899 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 165 |
| Missing (%) | 0.6% |
| Memory size | 214.0 KiB |
| MISCELLANEOUS PARKS, PLAYGROUNDS, CONSTRUCTION, RECONSTRUCTI | 123 |
|---|---|
| CONSTRUCTION OR ACQUISITION OF A NON-CITY OWNED PUBLIC BETTERMENT | 98 |
| CONSTRUCTION AND IMPROVEMENTS TO CUNY COMMUNITY COLLEGES, CITYWIDE | 66 |
| FIVE YEAR EDUCATIONAL FACILITIES CAPITAL PLAN | 65 |
| CONSTRUCTION, IMPROVEMENTS, ACQUISITION, ALL CULTURAL INSTITUTIONS | 62 |
| Other values (1894) |
| Value | Count | Frequency (%) | |
| MISCELLANEOUS PARKS, PLAYGROUNDS, CONSTRUCTION, RECONSTRUCTI | 123 | 0.4% | |
| CONSTRUCTION OR ACQUISITION OF A NON-CITY OWNED PUBLIC BETTERMENT | 98 | 0.4% | |
| CONSTRUCTION AND IMPROVEMENTS TO CUNY COMMUNITY COLLEGES, CITYWIDE | 66 | 0.2% | |
| FIVE YEAR EDUCATIONAL FACILITIES CAPITAL PLAN | 65 | 0.2% | |
| CONSTRUCTION, IMPROVEMENTS, ACQUISITION, ALL CULTURAL INSTITUTIONS | 62 | 0.2% | |
| SEVENTH REGIMENT ARMORY CONSERVANCY | 56 | 0.2% | |
| HENRY STREET SETTLEMENT | 56 | 0.2% | |
| QUALITY SERVICES FOR THE AUTISM COMMUNITY INC. (QSAC) | 53 | 0.2% | |
| MUSEUM OF CITY OF N. Y. IMPROVEMENTS | 51 | 0.2% | |
| CITY HARVEST, INC. | 51 | 0.2% | |
| JAMAICA ARTS CENTER, RECONSTRUCTION AND IMPROVEMENTS | 50 | 0.2% | |
| PLANNED PARENTHOOD OF NEW YORK CITY | 50 | 0.2% | |
| GOD'S LOVE WE DELIVER, INC. | 49 | 0.2% | |
| ABC NO RIO | 48 | 0.2% | |
| EDUCATIONAL ALLIANCE | 48 | 0.2% | |
| EYEBEAM, INC. | 47 | 0.2% | |
| NEW YORK RESTORATION PROJECT (NYRP) | 47 | 0.2% | |
| PREGONES THEATER | 46 | 0.2% | |
| SOUTH STREET SEAPORT MUSEUM | 45 | 0.2% | |
| JEWISH BOARD OF FAMILY AND CHILDREN'S SERVICES | 44 | 0.2% | |
| GOOD SHEPHERD SERVICES | 44 | 0.2% | |
| ASIAN AMERICANS FOR EQUALITY, INC. (AAFE) | 43 | 0.2% | |
| BALLET HISPANICO | 42 | 0.2% | |
| ST. ANN'S WAREHOUSE/ARTS AT ST. ANN'S | 42 | 0.2% | |
| MUSEUM OF THE MOVING IMAGE, THE AMERICAN | 42 | 0.2% | |
| Other values (1874) | 25845 | 94.4% | |
| (Missing) | 165 | 0.6% |
Frequencies of value counts
Unique
| Unique | 68 ? |
|---|---|
| Unique (%) | 0.2% |
Histogram of lengths of the category
Length
| Max length | 70 |
|---|---|
| Median length | 37 |
| Mean length | 39.05285266 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 121254 | 11.3% | ||
| E | 95922 | 9.0% | |
| N | 76984 | 7.2% | |
| T | 76666 | 7.2% | |
| O | 72886 | 6.8% | |
| I | 70721 | 6.6% | |
| R | 70502 | 6.6% | |
| A | 64517 | 6.0% | |
| S | 57075 | 5.3% | |
| C | 52102 | 4.9% | |
| L | 34770 | 3.3% | |
| U | 30699 | 2.9% | |
| M | 30466 | 2.8% | |
| D | 25576 | 2.4% | |
| H | 24781 | 2.3% | |
| P | 21949 | 2.1% | |
| Y | 18026 | 1.7% | |
| , | 17570 | 1.6% | |
| F | 15482 | 1.4% | |
| B | 14004 | 1.3% | |
| G | 13080 | 1.2% | |
| V | 11699 | 1.1% | |
| W | 11159 | 1.0% | |
| K | 8214 | 0.8% | |
| . | 7640 | 0.7% | |
| Other values (33) | 25445 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 906075 | 84.7% | |
| Space Separator | 121254 | 11.3% | |
| Other Punctuation | 30078 | 2.8% | |
| Decimal Number | 6564 | 0.6% | |
| Dash Punctuation | 1948 | 0.2% | |
| Open Punctuation | 1340 | 0.1% | |
| Close Punctuation | 1328 | 0.1% | |
| Lowercase Letter | 535 | 0.1% | |
| Currency Symbol | 46 | < 0.1% | |
| Control | 21 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 330 | 61.7% | |
| a | 165 | 30.8% | |
| t | 14 | 2.6% | |
| h | 14 | 2.6% | |
| o | 8 | 1.5% | |
| x | 4 | 0.7% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| E | 95922 | 10.6% | |
| N | 76984 | 8.5% | |
| T | 76666 | 8.5% | |
| O | 72886 | 8.0% | |
| I | 70721 | 7.8% | |
| R | 70502 | 7.8% | |
| A | 64517 | 7.1% | |
| S | 57075 | 6.3% | |
| C | 52102 | 5.8% | |
| L | 34770 | 3.8% | |
| U | 30699 | 3.4% | |
| M | 30466 | 3.4% | |
| D | 25576 | 2.8% | |
| H | 24781 | 2.7% | |
| P | 21949 | 2.4% | |
| Y | 18026 | 2.0% | |
| F | 15482 | 1.7% | |
| B | 14004 | 1.5% | |
| G | 13080 | 1.4% | |
| V | 11699 | 1.3% | |
| W | 11159 | 1.2% | |
| K | 8214 | 0.9% | |
| Q | 4075 | 0.4% | |
| J | 1914 | 0.2% | |
| X | 1890 | 0.2% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 1948 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 121254 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 17570 | 58.4% | |
| . | 7640 | 25.4% | |
| & | 1816 | 6.0% | |
| / | 1714 | 5.7% | |
| ' | 1055 | 3.5% | |
| : | 183 | 0.6% | |
| " | 56 | 0.2% | |
| # | 20 | 0.1% | |
| ; | 12 | < 0.1% | |
| ! | 12 | < 0.1% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 1340 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 1328 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 1473 | 22.4% | |
| 2 | 967 | 14.7% | |
| 0 | 804 | 12.2% | |
| 3 | 674 | 10.3% | |
| 5 | 596 | 9.1% | |
| 4 | 519 | 7.9% | |
| 8 | 440 | 6.7% | |
| 9 | 425 | 6.5% | |
| 7 | 367 | 5.6% | |
| 6 | 299 | 4.6% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 21 | 100.0% |
Most frequent Currency Symbol characters
| Value | Count | Frequency (%) | |
| $ | 46 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 906610 | 84.8% | |
| Common | 162579 | 15.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| E | 95922 | 10.6% | |
| N | 76984 | 8.5% | |
| T | 76666 | 8.5% | |
| O | 72886 | 8.0% | |
| I | 70721 | 7.8% | |
| R | 70502 | 7.8% | |
| A | 64517 | 7.1% | |
| S | 57075 | 6.3% | |
| C | 52102 | 5.7% | |
| L | 34770 | 3.8% | |
| U | 30699 | 3.4% | |
| M | 30466 | 3.4% | |
| D | 25576 | 2.8% | |
| H | 24781 | 2.7% | |
| P | 21949 | 2.4% | |
| Y | 18026 | 2.0% | |
| F | 15482 | 1.7% | |
| B | 14004 | 1.5% | |
| G | 13080 | 1.4% | |
| V | 11699 | 1.3% | |
| W | 11159 | 1.2% | |
| K | 8214 | 0.9% | |
| Q | 4075 | 0.4% | |
| J | 1914 | 0.2% | |
| X | 1890 | 0.2% | |
| Other values (7) | 1451 | 0.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 121254 | 74.6% | ||
| , | 17570 | 10.8% | |
| . | 7640 | 4.7% | |
| - | 1948 | 1.2% | |
| & | 1816 | 1.1% | |
| / | 1714 | 1.1% | |
| 1 | 1473 | 0.9% | |
| ( | 1340 | 0.8% | |
| ) | 1328 | 0.8% | |
| ' | 1055 | 0.6% | |
| 2 | 967 | 0.6% | |
| 0 | 804 | 0.5% | |
| 3 | 674 | 0.4% | |
| 5 | 596 | 0.4% | |
| 4 | 519 | 0.3% | |
| 8 | 440 | 0.3% | |
| 9 | 425 | 0.3% | |
| 7 | 367 | 0.2% | |
| 6 | 299 | 0.2% | |
| : | 183 | 0.1% | |
| " | 56 | < 0.1% | |
| $ | 46 | < 0.1% | |
| 21 | < 0.1% | ||
| # | 20 | < 0.1% | |
| ; | 12 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1069189 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 121254 | 11.3% | ||
| E | 95922 | 9.0% | |
| N | 76984 | 7.2% | |
| T | 76666 | 7.2% | |
| O | 72886 | 6.8% | |
| I | 70721 | 6.6% | |
| R | 70502 | 6.6% | |
| A | 64517 | 6.0% | |
| S | 57075 | 5.3% | |
| C | 52102 | 4.9% | |
| L | 34770 | 3.3% | |
| U | 30699 | 2.9% | |
| M | 30466 | 2.8% | |
| D | 25576 | 2.4% | |
| H | 24781 | 2.3% | |
| P | 21949 | 2.1% | |
| Y | 18026 | 1.7% | |
| , | 17570 | 1.6% | |
| F | 15482 | 1.4% | |
| B | 14004 | 1.3% | |
| G | 13080 | 1.2% | |
| V | 11699 | 1.1% | |
| W | 11159 | 1.0% | |
| K | 8214 | 0.8% | |
| . | 7640 | 0.7% | |
| Other values (33) | 25445 | 2.4% |
Funding Type
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 214.0 KiB |
| CITY | |
|---|---|
| NON CITY |
| Value | Count | Frequency (%) | |
| CITY | 23418 | 85.5% | |
| NON CITY | 3959 | 14.5% | |
| (Missing) | 1 | < 0.1% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.578384104 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| C | 27377 | 21.8% | |
| I | 27377 | 21.8% | |
| T | 27377 | 21.8% | |
| Y | 27377 | 21.8% | |
| N | 7918 | 6.3% | |
| O | 3959 | 3.2% | |
| 3959 | 3.2% | ||
| n | 2 | < 0.1% | |
| a | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 121385 | 96.8% | |
| Space Separator | 3959 | 3.2% | |
| Lowercase Letter | 3 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| C | 27377 | 22.6% | |
| I | 27377 | 22.6% | |
| T | 27377 | 22.6% | |
| Y | 27377 | 22.6% | |
| N | 7918 | 6.5% | |
| O | 3959 | 3.3% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 3959 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 2 | 66.7% | |
| a | 1 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 121388 | 96.8% | |
| Common | 3959 | 3.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| C | 27377 | 22.6% | |
| I | 27377 | 22.6% | |
| T | 27377 | 22.6% | |
| Y | 27377 | 22.6% | |
| N | 7918 | 6.5% | |
| O | 3959 | 3.3% | |
| n | 2 | < 0.1% | |
| a | 1 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 3959 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 125347 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| C | 27377 | 21.8% | |
| I | 27377 | 21.8% | |
| T | 27377 | 21.8% | |
| Y | 27377 | 21.8% | |
| N | 7918 | 6.3% | |
| O | 3959 | 3.2% | |
| 3959 | 3.2% | ||
| n | 2 | < 0.1% | |
| a | 1 | < 0.1% |
Number of Years Presented
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 214.0 KiB |
| 5 | |
|---|---|
| 4 |
| Value | Count | Frequency (%) | |
| 5 | 17738 | 64.8% | |
| 4 | 9640 | 35.2% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 5 | 17738 | 64.8% | |
| 4 | 9640 | 35.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 27378 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 5 | 17738 | 64.8% | |
| 4 | 9640 | 35.2% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 27378 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 5 | 17738 | 64.8% | |
| 4 | 9640 | 35.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 27378 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 5 | 17738 | 64.8% | |
| 4 | 9640 | 35.2% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2018.452095 |
|---|---|
| Minimum | 2016 |
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 214.0 KiB |
Quantile statistics
| Minimum | 2016 |
|---|---|
| 5-th percentile | 2016 |
| Q1 | 2017 |
| median | 2019 |
| Q3 | 2020 |
| 95-th percentile | 2020 |
| Maximum | 2020 |
| Range | 4 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.291061728 |
|---|---|
| Coefficient of variation (CV) | 0.0006396296107 |
| Kurtosis | -1.149156571 |
| Mean | 2018.452095 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.2606054237 |
| Sum | 55259163 |
| Variance | 1.666840384 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=5)
| Value | Count | Frequency (%) | |
| 2020 | 8011 | 29.3% | |
| 2019 | 5873 | 21.5% | |
| 2018 | 5871 | 21.4% | |
| 2017 | 5726 | 20.9% | |
| 2016 | 1896 | 6.9% | |
| (Missing) | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2016 | 1896 | 6.9% | |
| 2017 | 5726 | 20.9% | |
| 2018 | 5871 | 21.4% | |
| 2019 | 5873 | 21.5% | |
| 2020 | 8011 | 29.3% |
| Value | Count | Frequency (%) | |
| 2020 | 8011 | 29.3% | |
| 2019 | 5873 | 21.5% | |
| 2018 | 5871 | 21.4% | |
| 2017 | 5726 | 20.9% | |
| 2016 | 1896 | 6.9% |
| Distinct | 5388 |
|---|---|
| Distinct (%) | 19.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9730.912521 |
|---|---|
| Minimum | -50216 |
| Maximum | 19242283 |
| Zeros | 10517 |
| Zeros (%) | 38.4% |
| Memory size | 214.0 KiB |
Quantile statistics
| Minimum | -50216 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 77 |
| Q3 | 1239 |
| 95-th percentile | 32175.9 |
| Maximum | 19242283 |
| Range | 19292499 |
| Interquartile range (IQR) | 1239 |
Descriptive statistics
| Standard deviation | 138384.8173 |
|---|---|
| Coefficient of variation (CV) | 14.22115521 |
| Kurtosis | 13731.3725 |
| Mean | 9730.912521 |
| Median Absolute Deviation (MAD) | 77 |
| Skewness | 102.9564547 |
| Sum | 266412923 |
| Variance | 1.915035766e+10 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 10517 | 38.4% | |
| 500 | 417 | 1.5% | |
| 100 | 213 | 0.8% | |
| 250 | 200 | 0.7% | |
| 1000 | 192 | 0.7% | |
| 50 | 179 | 0.7% | |
| 1 | 178 | 0.7% | |
| 35 | 119 | 0.4% | |
| 300 | 106 | 0.4% | |
| 200 | 101 | 0.4% | |
| 400 | 98 | 0.4% | |
| 150 | 95 | 0.3% | |
| 2 | 85 | 0.3% | |
| 40 | 78 | 0.3% | |
| 2000 | 75 | 0.3% | |
| 36 | 74 | 0.3% | |
| 800 | 67 | 0.2% | |
| 1500 | 67 | 0.2% | |
| 750 | 66 | 0.2% | |
| 44 | 66 | 0.2% | |
| 350 | 65 | 0.2% | |
| 60 | 64 | 0.2% | |
| 30 | 60 | 0.2% | |
| 3000 | 54 | 0.2% | |
| 600 | 50 | 0.2% | |
| Other values (5363) | 14092 | 51.5% |
| Value | Count | Frequency (%) | |
| -50216 | 1 | < 0.1% | |
| -40675 | 9 | < 0.1% | |
| -40674 | 3 | < 0.1% | |
| -34904 | 2 | < 0.1% | |
| -30077 | 12 | < 0.1% | |
| -28707 | 1 | < 0.1% | |
| -22587 | 1 | < 0.1% | |
| -22584 | 1 | < 0.1% | |
| -22254 | 1 | < 0.1% | |
| -22232 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 19242283 | 1 | < 0.1% | |
| 3659433 | 1 | < 0.1% | |
| 3656015 | 1 | < 0.1% | |
| 3254645 | 1 | < 0.1% | |
| 3200720 | 2 | < 0.1% | |
| 2800720 | 1 | < 0.1% | |
| 2734862 | 1 | < 0.1% | |
| 2711902 | 1 | < 0.1% | |
| 2600720 | 1 | < 0.1% | |
| 2581220 | 2 | < 0.1% |
| Distinct | 4625 |
|---|---|
| Distinct (%) | 16.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9508.901381 |
|---|---|
| Minimum | -5799 |
| Maximum | 17161904 |
| Zeros | 16259 |
| Zeros (%) | 59.4% |
| Memory size | 214.0 KiB |
Quantile statistics
| Minimum | -5799 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 583.75 |
| 95-th percentile | 31006.9 |
| Maximum | 17161904 |
| Range | 17167703 |
| Interquartile range (IQR) | 583.75 |
Descriptive statistics
| Standard deviation | 125544.7638 |
|---|---|
| Coefficient of variation (CV) | 13.20286738 |
| Kurtosis | 12836.26011 |
| Mean | 9508.901381 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 98.30545073 |
| Sum | 260334702 |
| Variance | 1.576148772e+10 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 16259 | 59.4% | |
| 500 | 256 | 0.9% | |
| 100 | 195 | 0.7% | |
| 1000 | 172 | 0.6% | |
| 250 | 144 | 0.5% | |
| 200 | 106 | 0.4% | |
| 50 | 78 | 0.3% | |
| 1 | 70 | 0.3% | |
| 150 | 68 | 0.2% | |
| 2000 | 66 | 0.2% | |
| 750 | 53 | 0.2% | |
| 400 | 51 | 0.2% | |
| 300 | 50 | 0.2% | |
| 1500 | 50 | 0.2% | |
| 35 | 45 | 0.2% | |
| 2 | 41 | 0.1% | |
| 40 | 39 | 0.1% | |
| 4000 | 38 | 0.1% | |
| 3 | 35 | 0.1% | |
| 4 | 35 | 0.1% | |
| 350 | 34 | 0.1% | |
| 46 | 33 | 0.1% | |
| 3000 | 33 | 0.1% | |
| 5000 | 33 | 0.1% | |
| 36 | 31 | 0.1% | |
| Other values (4600) | 9363 | 34.2% |
| Value | Count | Frequency (%) | |
| -5799 | 1 | < 0.1% | |
| -300 | 1 | < 0.1% | |
| -16 | 1 | < 0.1% | |
| -13 | 1 | < 0.1% | |
| 0 | 16259 | 59.4% | |
| 1 | 70 | 0.3% | |
| 2 | 41 | 0.1% | |
| 3 | 35 | 0.1% | |
| 4 | 35 | 0.1% | |
| 5 | 15 | 0.1% |
| Value | Count | Frequency (%) | |
| 17161904 | 1 | < 0.1% | |
| 3193120 | 1 | < 0.1% | |
| 3172620 | 1 | < 0.1% | |
| 3124208 | 1 | < 0.1% | |
| 3003570 | 1 | < 0.1% | |
| 2970879 | 1 | < 0.1% | |
| 2737879 | 1 | < 0.1% | |
| 2552820 | 1 | < 0.1% | |
| 2514930 | 1 | < 0.1% | |
| 2504770 | 1 | < 0.1% |
| Distinct | 3266 |
|---|---|
| Distinct (%) | 11.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7966.039667 |
|---|---|
| Minimum | 0 |
| Maximum | 12257537 |
| Zeros | 19858 |
| Zeros (%) | 72.5% |
| Memory size | 214.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 50 |
| 95-th percentile | 24670.3 |
| Maximum | 12257537 |
| Range | 12257537 |
| Interquartile range (IQR) | 50 |
Descriptive statistics
| Standard deviation | 100105.2329 |
|---|---|
| Coefficient of variation (CV) | 12.56649943 |
| Kurtosis | 8445.795657 |
| Mean | 7966.039667 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 76.32076244 |
| Sum | 218094234 |
| Variance | 1.002105766e+10 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 19858 | 72.5% | |
| 500 | 170 | 0.6% | |
| 1000 | 132 | 0.5% | |
| 100 | 96 | 0.4% | |
| 250 | 90 | 0.3% | |
| 200 | 71 | 0.3% | |
| 2000 | 60 | 0.2% | |
| 1 | 55 | 0.2% | |
| 50 | 47 | 0.2% | |
| 1500 | 36 | 0.1% | |
| 300 | 35 | 0.1% | |
| 400 | 35 | 0.1% | |
| 750 | 33 | 0.1% | |
| 150 | 33 | 0.1% | |
| 700 | 33 | 0.1% | |
| 4000 | 31 | 0.1% | |
| 35 | 31 | 0.1% | |
| 2 | 31 | 0.1% | |
| 600 | 28 | 0.1% | |
| 5 | 28 | 0.1% | |
| 3 | 28 | 0.1% | |
| 52 | 26 | 0.1% | |
| 40 | 25 | 0.1% | |
| 5000 | 25 | 0.1% | |
| 450 | 23 | 0.1% | |
| Other values (3241) | 6318 | 23.1% |
| Value | Count | Frequency (%) | |
| 0 | 19858 | 72.5% | |
| 1 | 55 | 0.2% | |
| 2 | 31 | 0.1% | |
| 3 | 28 | 0.1% | |
| 4 | 19 | 0.1% | |
| 5 | 28 | 0.1% | |
| 6 | 8 | < 0.1% | |
| 7 | 9 | < 0.1% | |
| 8 | 15 | 0.1% | |
| 9 | 19 | 0.1% |
| Value | Count | Frequency (%) | |
| 12257537 | 1 | < 0.1% | |
| 3509320 | 1 | < 0.1% | |
| 3431320 | 1 | < 0.1% | |
| 3105320 | 1 | < 0.1% | |
| 2707879 | 1 | < 0.1% | |
| 2705379 | 1 | < 0.1% | |
| 2697679 | 1 | < 0.1% | |
| 2552820 | 1 | < 0.1% | |
| 2502820 | 1 | < 0.1% | |
| 2195569 | 1 | < 0.1% |
| Distinct | 2659 |
|---|---|
| Distinct (%) | 9.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7158.49474 |
|---|---|
| Minimum | -2494 |
| Maximum | 11175734 |
| Zeros | 21110 |
| Zeros (%) | 77.1% |
| Memory size | 214.0 KiB |
Quantile statistics
| Minimum | -2494 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 17050.25 |
| Maximum | 11175734 |
| Range | 11178228 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 97088.433 |
|---|---|
| Coefficient of variation (CV) | 13.56268832 |
| Kurtosis | 6759.432391 |
| Mean | 7158.49474 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 67.77666991 |
| Sum | 195985269 |
| Variance | 9426163822 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 21110 | 77.1% | |
| 500 | 127 | 0.5% | |
| 1000 | 97 | 0.4% | |
| 100 | 82 | 0.3% | |
| 250 | 63 | 0.2% | |
| 200 | 62 | 0.2% | |
| 1 | 50 | 0.2% | |
| 50 | 47 | 0.2% | |
| 2 | 41 | 0.1% | |
| 2000 | 40 | 0.1% | |
| 5 | 39 | 0.1% | |
| 3 | 38 | 0.1% | |
| 300 | 34 | 0.1% | |
| 10 | 28 | 0.1% | |
| 400 | 28 | 0.1% | |
| 750 | 27 | 0.1% | |
| 5000 | 25 | 0.1% | |
| 3000 | 25 | 0.1% | |
| 150 | 24 | 0.1% | |
| 4000 | 24 | 0.1% | |
| 600 | 23 | 0.1% | |
| 1500 | 22 | 0.1% | |
| 25 | 22 | 0.1% | |
| 35 | 22 | 0.1% | |
| 36 | 19 | 0.1% | |
| Other values (2634) | 5259 | 19.2% |
| Value | Count | Frequency (%) | |
| -2494 | 1 | < 0.1% | |
| 0 | 21110 | 77.1% | |
| 1 | 50 | 0.2% | |
| 2 | 41 | 0.1% | |
| 3 | 38 | 0.1% | |
| 4 | 10 | < 0.1% | |
| 5 | 39 | 0.1% | |
| 6 | 16 | 0.1% | |
| 7 | 15 | 0.1% | |
| 8 | 14 | 0.1% |
| Value | Count | Frequency (%) | |
| 11175734 | 1 | < 0.1% | |
| 3588380 | 1 | < 0.1% | |
| 3431320 | 1 | < 0.1% | |
| 3426320 | 1 | < 0.1% | |
| 3414380 | 1 | < 0.1% | |
| 3088380 | 1 | < 0.1% | |
| 2692620 | 1 | < 0.1% | |
| 2226316 | 2 | < 0.1% | |
| 2193215 | 1 | < 0.1% | |
| 2165569 | 3 | < 0.1% |
| Distinct | 1594 |
|---|---|
| Distinct (%) | 9.0% |
| Missing | 9640 |
| Missing (%) | 35.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5955.045383 |
|---|---|
| Minimum | 0 |
| Maximum | 3414380 |
| Zeros | 14676 |
| Zeros (%) | 53.6% |
| Memory size | 214.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 10355.35 |
| Maximum | 3414380 |
| Range | 3414380 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 70000.65979 |
|---|---|
| Coefficient of variation (CV) | 11.75484909 |
| Kurtosis | 1280.733389 |
| Mean | 5955.045383 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 32.34754482 |
| Sum | 105630595 |
| Variance | 4900092372 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 14676 | 53.6% | |
| 1000 | 47 | 0.2% | |
| 50 | 36 | 0.1% | |
| 100 | 35 | 0.1% | |
| 2000 | 31 | 0.1% | |
| 500 | 31 | 0.1% | |
| 1 | 24 | 0.1% | |
| 250 | 22 | 0.1% | |
| 4000 | 19 | 0.1% | |
| 2 | 19 | 0.1% | |
| 300 | 18 | 0.1% | |
| 6000 | 18 | 0.1% | |
| 3000 | 17 | 0.1% | |
| 10000 | 16 | 0.1% | |
| 750 | 14 | 0.1% | |
| 25 | 14 | 0.1% | |
| 25000 | 14 | 0.1% | |
| 150 | 13 | < 0.1% | |
| 3 | 13 | < 0.1% | |
| 700 | 13 | < 0.1% | |
| 12000 | 12 | < 0.1% | |
| 400 | 12 | < 0.1% | |
| 200 | 12 | < 0.1% | |
| 10 | 12 | < 0.1% | |
| 40 | 12 | < 0.1% | |
| Other values (1569) | 2588 | 9.5% | |
| (Missing) | 9640 | 35.2% |
| Value | Count | Frequency (%) | |
| 0 | 14676 | 53.6% | |
| 1 | 24 | 0.1% | |
| 2 | 19 | 0.1% | |
| 3 | 13 | < 0.1% | |
| 4 | 6 | < 0.1% | |
| 5 | 11 | < 0.1% | |
| 6 | 6 | < 0.1% | |
| 7 | 9 | < 0.1% | |
| 8 | 7 | < 0.1% | |
| 9 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 3414380 | 2 | < 0.1% | |
| 2913780 | 2 | < 0.1% | |
| 2226316 | 1 | < 0.1% | |
| 2201113 | 1 | < 0.1% | |
| 2165569 | 2 | < 0.1% | |
| 2066667 | 1 | < 0.1% | |
| 2032329 | 1 | < 0.1% | |
| 1421489 | 1 | < 0.1% | |
| 974089 | 1 | < 0.1% | |
| 782295 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Published Date | Project Type | Project Type Description | Budget Line | Budget Line Description | Funding Type | Number of Years Presented | First Fiscal Year | Fiscal Year 1 Amount | Fiscal Year 2 Amount | Fiscal Year 3 Amount | Fiscal Year 4 Amount | Fiscal Year 5 Amount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 20160426.0 | HD | HOUSING & DEVELOPMENT | HDKN525 | NaN | CITY | 5 | 2016.0 | 0 | 500 | 0 | 0 | 0.0 |
| 1 | 20160426.0 | E | EDUCATION | E M4001 | FIT | CITY | 5 | 2016.0 | 40 | 150 | 0 | 0 | 0.0 |
| 2 | 20160426.0 | AG | DEPARTMENT FOR THE AGING | AGDN100 | CHINESE-AMERICAN PLANNING COUNCIL | CITY | 5 | 2016.0 | 503 | 0 | 0 | 0 | 0.0 |
| 3 | 20160426.0 | AG | DEPARTMENT FOR THE AGING | AGDN145 | ELMCOR YOUTH AND ADULT ACTIVITIES, INC. | CITY | 5 | 2016.0 | 0 | 0 | 510 | 0 | 0.0 |
| 4 | 20160426.0 | AG | DEPARTMENT FOR THE AGING | AGDN169 | GLENRIDGE SENIOR CENTER | CITY | 5 | 2016.0 | 0 | 118 | 0 | 0 | 0.0 |
| 5 | 20160426.0 | AG | DEPARTMENT FOR THE AGING | AGDN184 | HEBREW HOME FOR THE AGED | CITY | 5 | 2016.0 | 1658 | 0 | 1149 | 0 | 0.0 |
| 6 | 20160426.0 | AG | DEPARTMENT FOR THE AGING | AGDN216 | JEWISH COMMUNITY COUNCIL OF GREATER CONEY ISLAND (JCCGCI) | CITY | 5 | 2016.0 | 119 | 331 | 0 | 0 | 0.0 |
| 7 | 20160426.0 | AG | DEPARTMENT FOR THE AGING | AGDN235 | LENOX HILL NEIGHBORHOOD ASSOCIATION | CITY | 5 | 2016.0 | 236 | 0 | 0 | 3596 | 0.0 |
| 8 | 20160426.0 | AG | DEPARTMENT FOR THE AGING | AGDN262 | MET COUNCIL ON JEWISH POVERTY | CITY | 5 | 2016.0 | 282 | 164 | 2907 | 257 | 0.0 |
| 9 | 20160426.0 | AG | DEPARTMENT FOR THE AGING | AGDN334 | PRESBYTERIAN SENIOR SERVICES | CITY | 5 | 2016.0 | 50 | 0 | 0 | 0 | 0.0 |
Last rows
| Published Date | Project Type | Project Type Description | Budget Line | Budget Line Description | Funding Type | Number of Years Presented | First Fiscal Year | Fiscal Year 1 Amount | Fiscal Year 2 Amount | Fiscal Year 3 Amount | Fiscal Year 4 Amount | Fiscal Year 5 Amount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 27368 | 20200416.0 | WP | WATER POLLUTION CONTROL | WP0247 | UPGRADE JAMAICA WATER POLLUTION CONTROL PROJECT | CITY | 5 | 2020.0 | 2030 | 0 | 0 | 0 | 0.0 |
| 27369 | 20200416.0 | WP | WATER POLLUTION CONTROL | WP0249 | UPGRADE TALLMANS ISLAND WATER POLLUTION CONTROL PROJECT | CITY | 5 | 2020.0 | 1970 | 10676 | 0 | 0 | 0.0 |
| 27370 | 20200416.0 | WP | WATER POLLUTION CONTROL | WP0269 | CONSTRUCTION, RECONSTRUCTION OF PUMPING STATION/FORCE MAINS, CITYWIDE | CITY | 5 | 2020.0 | 18757 | 142873 | 38261 | 201565 | 42414.0 |
| 27371 | 20200416.0 | WP | WATER POLLUTION CONTROL | WP0269 | CONSTRUCTION, RECONSTRUCTION OF PUMPING STATION/FORCE MAINS, CITYWIDE | NON CITY | 5 | 2020.0 | 1060 | 1980 | 0 | 0 | 2700.0 |
| 27372 | 20200416.0 | WP | WATER POLLUTION CONTROL | WP0282 | ENG., ARCH., ADMIN. AND OTHER COSTS, DEPT. OF WATER RESOURCES | CITY | 5 | 2020.0 | 65348 | 34421 | 23961 | 53062 | 40590.0 |
| 27373 | 20200416.0 | WP | WATER POLLUTION CONTROL | WP0283 | UPGRADE NEWTOWN CREEK WATER POLLUTION CONTROL PROJECT | CITY | 5 | 2020.0 | -3623 | 3039 | 0 | 0 | 0.0 |
| 27374 | 20200416.0 | WP | WATER POLLUTION CONTROL | WP0284 | CITY-WIDE SLUDGE DISPOSAL FACILITIES | CITY | 5 | 2020.0 | -943 | 0 | 0 | 0 | 0.0 |
| 27375 | 20200416.0 | WP | WATER POLLUTION CONTROL | WP0285 | BIONUTRIENT REMOVAL FACILITIES, CITYWIDE | CITY | 5 | 2020.0 | 1429 | 1000 | 2000 | 19364 | 0.0 |
| 27376 | 20200416.0 | WP | WATER POLLUTION CONTROL | WP0287 | UPGRADE CONEY ISLAND WATER POLLUTION CONTROL PROJECT | CITY | 5 | 2020.0 | -29 | 0 | 0 | 0 | 0.0 |
| 27377 | 20200416.0 | WP | WATER POLLUTION CONTROL | WP0288 | UPGRADE OWLS HEAD WATER POLLUTION CONTROL PROJECT | CITY | 5 | 2020.0 | -531 | 0 | 0 | 0 | 0.0 |